AITopics | Burlington County

Collaborating Authors

Burlington County

John Kirby grilled on mysterious New Jersey drone sightings: 'Why don't we know?'

FOX NewsDec-13-2024, 23:54:22 GMT

White House National Security Communications Advisor John Kirby responds to more questions over the aerial systems on'The Story.' White House National Security Communications Advisor John Kirby maintained that the government still lacks definitive answers regarding the nature of reported drone sightings as public frustration intensifies. "Many of the corroborated sightings have turned out to be piloted aircraft. I didn't say all of them, and what I said was those are the ones we were able to corroborate," Kirby said on "The Story." "There certainly is ones that we have not been able to, and we don't know the answer to it, and I strongly recommend that for folks that are seeing these things and documenting them to share that as they can with the Department of Homeland Security and the FBI." In a Wednesday letter to Biden, New Jersey Gov. Phil Murphy asked the president for more federal resources to address drone sightings, noting that the federal law limits the ability of state and local law enforcement to counter drones.

artificial intelligence, drone, john kirby, (9 more...)

FOX News

Country:

Europe > Jersey (0.71)
North America > United States > New Jersey > Monmouth County (0.08)
North America > United States > New Jersey > Ocean County (0.05)
North America > United States > New Jersey > Burlington County (0.05)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.51)

Add feedback

Query-oriented Data Augmentation for Session Search

Chen, Haonan, Dou, Zhicheng, Zhu, Yutao, Wen, Ji-Rong

arXiv.org Artificial IntelligenceJul-4-2024

Modeling contextual information in a search session has drawn more and more attention when understanding complex user intents. Recent methods are all data-driven, i.e., they train different models on large-scale search log data to identify the relevance between search contexts and candidate documents. The common training paradigm is to pair the search context with different candidate documents and train the model to rank the clicked documents higher than the unclicked ones. However, this paradigm neglects the symmetric nature of the relevance between the session context and document, i.e., the clicked documents can also be paired with different search contexts when training. In this work, we propose query-oriented data augmentation to enrich search logs and empower the modeling. We generate supplemental training pairs by altering the most important part of a search context, i.e., the current query, and train our model to rank the generated sequence along with the original sequence. This approach enables models to learn that the relevance of a document may vary as the session context changes, leading to a better understanding of users' search patterns. We develop several strategies to alter the current query, resulting in new training data with varying degrees of difficulty. Through experimentation on two extensive public search logs, we have successfully demonstrated the effectiveness of our model.

current query, query, search log, (16 more...)

arXiv.org Artificial Intelligence

2407.0372

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Wisconsin > Racine County (0.04)
(25 more...)

Genre: Research Report (1.00)

Industry: Education (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

Wang, Junjie, Chen, Mingyang, Hu, Binbin, Yang, Dan, Liu, Ziqi, Shen, Yue, Wei, Peng, Zhang, Zhiqiang, Gu, Jinjie, Zhou, Jun, Pan, Jeff Z., Zhang, Wen, Chen, Huajun

arXiv.org Artificial IntelligenceJun-20-2024

Improving the performance of large language models (LLMs) in complex question-answering (QA) scenarios has always been a research focal point. Recent studies have attempted to enhance LLMs' performance by combining step-wise planning with external retrieval. While effective for advanced models like GPT-3.5, smaller LLMs face challenges in decomposing complex questions, necessitating supervised fine-tuning. Previous work has relied on manual annotation and knowledge distillation from teacher LLMs, which are time-consuming and not accurate enough. In this paper, we introduce a novel framework for enhancing LLMs' planning capabilities by using planning data derived from knowledge graphs (KGs). LLMs fine-tuned with this data have improved planning capabilities, better equipping them to handle complex QA tasks that involve retrieval. Evaluations on multiple datasets, including our newly proposed benchmark, highlight the effectiveness of our framework and the benefits of KG-derived planning data.

query, str, subgraph query, (15 more...)

arXiv.org Artificial Intelligence

2406.14282

Country:

Africa > Namibia (0.14)
Asia > Vietnam (0.05)
Asia > China > Chongqing Province > Chongqing (0.04)
(7 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Translating Embeddings for Modeling Multi-relational Data

Neural Information Processing SystemsMar-13-2024, 15:10:51 GMT

We consider the problem of embedding entities and relationships of multirelational data in low-dimensional vector spaces. Our objective is to propose a canonical model which is easy to train, contains a reduced number of parameters and can scale up to very large databases. Hence, we propose TransE, a method which models relationships by interpreting them as translations operating on the low-dimensional embeddings of the entities. Despite its simplicity, this assumption proves to be powerful since extensive experiments show that TransE significantly outperforms state-of-the-art methods in link prediction on two knowledge bases. Besides, it can be successfully trained on a large scale data set with 1M entities, 25k relationships and more than 17M training samples.

transe, translation, triplet, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Bucks County (0.14)
North America > United States > New Jersey > Ocean County (0.14)
North America > United States > New Jersey > Atlantic County (0.14)
(10 more...)

Genre: Research Report (0.48)

Industry:

Media > Film (1.00)
Leisure & Entertainment > Sports (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.34)

Add feedback

GRAM: Global Reasoning for Multi-Page VQA

Blau, Tsachi, Fogel, Sharon, Ronen, Roi, Golts, Alona, Ganz, Roy, Avraham, Elad Ben, Aberdam, Aviad, Tsiper, Shahar, Litman, Ron

arXiv.org Artificial IntelligenceJan-7-2024

The increasing use of transformer-based large language models brings forward the challenge of processing long sequences. In document visual question answering (DocVQA), leading methods focus on the single-page setting, while documents can span hundreds of pages. We present GRAM, a method that seamlessly extends pre-trained single-page models to the multi-page setting, without requiring computationally-heavy pretraining. To do so, we leverage a single-page encoder for local page-level understanding, and enhance it with document-level designated layers and learnable tokens, facilitating the flow of information across pages for global reasoning. To enforce our model to utilize the newly introduced document-level tokens, we propose a tailored bias adaptation method. For additional computational savings during decoding, we introduce an optional compression stage using our C-Former model, which reduces the encoded sequence length, thereby allowing a tradeoff between quality and latency. Extensive experiments showcase GRAM's state-of-the-art performance on the benchmarks for multi-page DocVQA, demonstrating the effectiveness of our approach.

dataset, encoder, information, (14 more...)

arXiv.org Artificial Intelligence

2401.03411

Country:

Europe > Russia (0.14)
Asia > Russia (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
(107 more...)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Government > Space Agency (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset

Qi, Jiexing, Li, Shuhao, Guo, Zhixin, Huang, Yusheng, Zhou, Chenghu, Zhang, Weinan, Wang, Xinbing, Lin, Zhouhan

arXiv.org Artificial IntelligenceFeb-19-2023

Real-world data usually exhibits a long-tailed distribution,with a few frequent labels and a lot of few-shot labels. The study of institution name normalization is a perfect application case showing this phenomenon. There are many institutions worldwide with enormous variations of their names in the publicly available literature. In this work, we first collect a large-scale institution name normalization dataset LoT-insts1, which contains over 25k classes that exhibit a naturally long-tailed distribution. In order to isolate the few-shot and zero-shot learning scenarios from the massive many-shot classes, we construct our test set from four different subsets: many-, medium-, and few-shot sets, as well as a zero-shot open set. We also replicate several important baseline methods on our data, covering a wide range from search-based methods to neural network methods that use the pretrained BERT model. Further, we propose our specially pretrained, BERT-based model that shows better out-of-distribution generalization on few-shot and zero-shot test sets. Compared to other datasets focusing on the long-tailed phenomenon, our dataset has one order of magnitude more training data than the largest existing long-tailed datasets and is naturally long-tailed rather than manually synthesized. We believe it provides an important and different scenario to study this problem. To our best knowledge, this is the first natural language dataset that focuses on long-tailed and open-set classification problems.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2302.09509

Country:

North America > United States > New Jersey > Burlington County (0.15)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Ohio (0.05)
(6 more...)

Genre:

Research Report (0.50)
Workflow (0.46)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)

Add feedback

Natural Language Processing for Systems Engineering: Automatic Generation of Systems Modelling Language Diagrams

Zhong, Shaohong, Scarinci, Andrea, Cicirello, Alice

arXiv.org Artificial IntelligenceNov-19-2022

The design of complex engineering systems is an often long and articulated process that highly relies on engineers' expertise and professional judgment. As such, the typical pitfalls of activities involving the human factor often manifest themselves in terms of lack of completeness or exhaustiveness of the analysis, inconsistencies across design choices or documentation, as well as an implicit degree of subjectivity. An approach is proposed to assist systems engineers in the automatic generation of systems diagrams from unstructured natural language text. Natural Language Processing (NLP) techniques are used to extract entities and their relationships from textual resources (e.g., specifications, manuals, technical reports, maintenance reports) available within an organisation, and convert them into Systems Modelling Language (SysML) diagrams, with particular focus on structure and requirement diagrams. The intention is to provide the users with a more standardised, comprehensive and automated starting point onto which subsequently refine and adapt the diagrams according to their needs. The proposed approach is flexible and open-domain. It consists of six steps which leverage open-access tools, and it leads to an automatic generation of SysML diagrams without intermediate modelling requirement, but through the specification of a set of parameters by the user. The applicability and benefits of the proposed approach are shown through six case studies having different textual sources as inputs, and benchmarked against manually defined diagram elements.

artificial intelligence, natural language, text processing, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.knosys.2022.110071

2208.05008

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
(8 more...)

Genre:

Research Report (1.00)
Workflow (0.93)

Industry: Health & Medicine (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.47)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.46)

Add feedback